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(57) Abstract 

An encoder reorders quantized 
wavelet coefficients to cluster large and 
small wavelet coefficients into separate 
groups without requiring the use of 
data-dependent data structures. The 
coefficients are then adaptively encoded 
based on a run-length code which 
continuously modifies a parameter that 
controls the codewords uses to represent 
strings of quantized coefficients, seeking to 
minimize the number of bits spent in the 
codewords. A matrix of indices contains 
the coarsest coefficients in the upper left 
corner, and filling in low high and high low 
sub bands' in larger and larger blocks in an 
alternating manner, such that low high sub 
bands comprise the top of the matrix and 
the high low sub bands comprise the left 
side of the matrix. The shortest codewords 
are assigned to represent a run of the most 
likely character having length of 2 k , where 
k is a parameter, k is adjusted based on 
successive characters being encountered, 
k is increased when the character is the 
same, and decreased when the character 
is different. A decoder applies the above 
in reverse order. 
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LOSSLESS ADAPTIVE ENCODING OF FINITE ALPHABET DATA 

Field of the Invention 

This invention relates generally to the field of image compression and in 
5 particular to an improved wavelet encoding and decoding of digital pictures. 

Reference to Related Applications 
This application is related to co-pending applications having attorney 
docket numbers 777.264US1 entitled Image Encoding Using Reordering and 
Blocking of Wavelet Coefficients Combined with Adaptive Encoding and 
10 777.265US1 entitled Reordering Wavelet Coefficients For Improved 

Encoding and assigned to the same assignee as the present application and filed 
on the same day herewith and incorporated by reference. 

Copyright Notice/Permission 
A portion of the disclosure of this patent document contains material 
15 which is subject to copyright protection. The copyright owner has no objection 
to the facsimile reproduction by anyone of the patent document or the patent 
disclosure as it appears in the Patent and Trademark Office patent file or records, 
but otherwise reserves all copyright rights whatsoever. The following notice 
applies to the software and data as described below and in the drawing hereto: 
20 Copyright © 1998, Microsoft Corporation, All Rights Reserved. 

Background 

Digital pictures are used in many applications, such as Web pages, CD- 
ROM encyclopedias, digital cameras, and others. In most cases is necessary to 
compress the pictures, in order for them to fit into a small amount of storage or 

25 to be downloaded in a short amount of time. For example, in a typical digital 

camera, pictures are taken at a resolution of 1024x768 picture elements (pixels), 
with a resolution of 12 to 24 bits per pixel. The raw data in each image is 
therefore around 1.2 to 2.5 megabytes. In order to fit several pictures in a 
computer diskette, for example, it is necessary to reduce the amount of data used 

30 by each picture. The larger the compression ration that is achieved, the more 
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pictures will fit into a diskette or memory card and the faster they can be 
transferred via bandwidth limited transmission medium such as telephone lines. 

Image compression has been extensively studied over the past twenty 
years. The JPEG standard, defined by the JPEG (joint photographic experts 
5 group) committee of ISO (International Standards Organization), was defined in 
1992 and is the most popular method of compressing digital pictures. In JPEG, 
small square blocks of pixels (of dimensions 8x8) are mapped into the frequency 
domain by means of a discrete cosine transform (DCT). The DCT coefficients 
are quantized (divided by a scale factor and rounded to the nearest integer) and 

10 mapped to a one-dimensional vector via a fixed zigzag scan pattern. That vector 
is encoded via a combination of run- length and Huffman encoding. 

The independent processing of small 8x8 blocks in JPEG is an advantage 
from an implementation viewpoint, especially in low-cost hardware. However, 
it also leads to the main problem with JPEG: blocking artifacts. Because the 

15 quantization errors from adjacent blocks are uncorrected among blocks but 

correlated within the blocks, the boundaries of the 8x8 blocks becomes visible in 
the reconstructed image due to the potential difference in encoding between 
adjacent blocks. Such artifacts are referred to as tiling or blocking artifacts which 
can be reduced (but not completely eliminated) by using transforms with 

20 overlapping basis functions. 

An efficient way to remove the blocking artifacts is to replace the block 
DCT by a wavelet decomposition, which provides an efficient time-frequency 
representation. Very good compression performance can be obtained by 
quantizing and encoding wavelet coefficients. 

25 Many wavelet-based image compression systems have been reported in 

the technical literature in the past few years. With wavelets it is possible to 
achieve compression ratios that typically range from 20% to 50% better than 
JPEG. More importantly, wavelet transforms lead to pictures that do not have the 
disturbing blocking artifacts of JPEG. Therefore, wavelet-based transforms are 

2 
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becoming increasingly popular. In fact, in the next revision of JPEG, named 
JPEG2000, all proposals under consideration use wavelets. 

Some prior wavelet transforms decompose images into coefficients 
corresponding to 1 6 subbands. This results in a four by four matrix of subbands, 
5 referred to as a big block format, representing spectral decomposition and 
ordering of channels. The letters L and H are used to identifying low pass 

i 

filtering and high pass filtering respectively for each subband. The first subband 
comprises LL and HL coefficients, where the first letter in each set correspond to 
horizontal filtering and the second corresponds to vertical filtering. Two stages 
10 are used in each subband filtering combination. The ordering corresponds to 

frequencies increasing from left to right and from bottom to top. This ordering is 
fixed to allow both encoding and decoding to function in a fixed manner. 
Quantization of the coefficients is then performed, followed by some form of 

•? 

compressive encoding of the coefficients, including adaptive Huffman encoding 

. «< 

1 5 or arithmetic encoding to further compress the image. These forms of encoding 
can be quite complex, including zero tree structures, which depend on the data 
types. These encoders are fairly complex, and many need to be modified for 
different images to be compressed, making them difficult to implement in 
hardware. 

20 There is a need for a simple encoding technique that works on wavelet 

compression coefficients and similar finite alphabet data which may be 
implemented in either hardware or software. 

Summary of the Invention 
Adaptive encoding is performed on signed integer data for which values 

25 of smaller magnitude are more likely to occur than those with higher 
magnitudes. The encoding is performed in bit planes, which allows for 
scalability in precision of reconstruction, from lossless (no errors) to various 
levels of approximated reconstruction. Unlike Huffman encoding, code tables 
are not necessary because simple rules determine the code words from the input 

30 strings. 
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In one form, for each bit plane, the shortest code word (a single 0) is 
assigned to represent a run of the most likely input, zero, having length 2 k , where 
k is a parameter that controls the codewords used to represent strings of 
quantized coefficients, seeking to minimize the number of bits spent in the 
5 cocewords. K is adapted to increase when longer runs are encountered, and to 
decrease otherwise, e.g., when symbols different from those in the run are 
encountered. The encoding of the bit planes can be done with any efficient 
entropy encoder, such as an adaptive arithmetic encoder, but in one 
implementation, a new adaptive Run-length plus Golomb-Rice encoder is used. 
10 By not requiring the use of data-dependent data structures such as 

zerotrees, or a separate list for set partitions in trees, hardware implementations 
are easier to build and software implementations may run faster. 

Brief Description of the Drawings 
Figure 1 is a block diagram of a computer system on which the present 
15 invention may be implemented. 

Figure 2 is a block diagram of an encoder that reorders wavelet 

coefficients and encodes then in a lossless adaptive manner. 
Figure 3 is a block diagram of a decoder that decodes and unshuffles the 
encoded coefficients produced by the encoder of Figure 2. 
20 Figure 4 is a block diagram of the reordered wavelet coefficients produced 
by the encoder of Figure 2. 
Figure 5 is a flow chart showing high level operation of the coefficient 
encoder of Figure 2, separating the coefficients into bit planes. 
Figure 6 is a flow chart showing further detail of the operation of the run- 
25 length adaptive encoder of Figure 2. 

Figure 7 is a flow chart showing the writing of a matrix of coefficients in a 

reordered manner consistent with that shown in Figure 4. 
Figure 8 is a block diagram showing the use of the encoder of Figure 2 and 
the decoder of Figure 3 in a software application suite which 
30 handles image data. 
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Detailed Description 

In the following detailed description of exemplary embodiments of the 
invention, reference is made to the accompanying drawings, which form a part 
hereof, and in which is shown by way of illustration specific exemplary 
5 embodiments in which the invention may be practiced. These embodiments are 
described in sufficient detail to enable those skilled in the art to practice the 
invention, and it is to be understood that other embodiments may be utilized and 
that logical, mechanical, electrical and other changes may be made without 
departing from the spirit or scope of the present invention. The following 

10 detailed description is, therefore, not to be taken in a limiting sense, and the 
scope of the present invention is defined only by the appended claims. 

The detailed description is divided into multiple sections. A first section 
describes the operation of a computer system that implements the current 
invention. This is followed by a high level description of a fixed reordering of 

15 quantized wavelet coefficients and adaptive run-length encoding of them. A 
decoder for such encoded data is also described. Further detail of selected 
blocks of the high level description is then described by use of flowcharts. This 
is followed by a general description of the use of such encoders and decoders in 
an office suite of software applications. A conclusion describes some potential 

20 benefits and describes further alternative embodiments. 

Hardware and Operating Environment 
Figure 1 provides a brief, general description of a suitable computing 
environment in which the invention may be implemented. The invention will 
hereinafter be described in the general context of computer-executable program 

25 modules containing instructions executed by a personal computer (PC). Program 
modules include routines, programs, objects, components, data structures, etc. 
that perform particular tasks or implement particular abstract data types. Those 
skilled in the art will appreciate that the invention may be practiced with other 
computer-system configurations, including hand-held devices, multiprocessor 

30 systems, microprocessor-based programmable consumer electronics, network 
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PCs, minicomputers, mainframe computers, and the like which have multimedia 
capabilities. The invention may also be practiced in distributed computing 
environments where tasks are performed by remote processing devices linked 
through a communications network. In a distributed computing environment, 
5 program modules may be located in both local and remote memory storage 
devices. 

Figure 1 shows a general -purpose computing device in the form of a 
conventional personal computer 20, which includes processing unit 21, system 
memory 22, and system bus 23 that couples the system memory and other 

10 system components to processing unit 21 . System bus 23 may be any of several 
types, including a memory bus or memory controller, a peripheral bus, and a 
local bus, and may use any of a variety of bus structures. System memory 22 
includes read-only memory (ROM) 24 and random-access memory (RAM) 25. 
A basic input/output system (BIOS) 26, stored in ROM 24, contains the basic 

15 routines that transfer information between components of personal computer 20. 
BIOS 26 also contains start-up routines for the system. Personal computer 20 
further includes hard disk drive 27 for reading from and writing to a hard disk 
(not shown), magnetic disk drive 28 for reading from and writing to a removable 
magnetic disk 29, and optical disk drive 30 for reading from and writing to a 

20 removable optical disk 31 such as a CD-ROM or other optical medium. Hard 
disk drive 27, magnetic disk drive 28, and optical disk drive 30 are connected to 
system bus 23 by a hard-disk drive interface 32, a magnetic-disk drive interface 
33, and an optical-drive interface 34, respectively. The drives and their 
associated computer-readable media provide nonvolatile storage of computer- 

25 readable instructions, data structures, program modules and other data for 

personal computer 20. Although the exemplary environment described herein 
employs a hard disk, a removable magnetic disk 29 and a removable optical disk 
31, those skilled in the art will appreciate that other types of computer-readable 
media which can store data accessible by a computer may also be used in the 

30 exemplary operating environment. Such media may include magnetic cassettes, 
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flash-memory cards, digital versatile disks, Bernoulli cartridges, RAMs, ROMs, 
and the like. 

Program modules may be stored on the hard disk, magnetic disk 29, 
optical disk 31, ROM 24 and RAM 25. Program modules may include operating 

5 system 35, one or more application programs 36, other program modules 37, and 
program data 38. A user may enter commands and information into personal 
computer 20 through input devices such as a keyboard 40 and a pointing device 
42. Other input devices (not shown) may include a microphone, joystick, game 
pad, satellite dish, scanner, or the like. These and other input devices are often 

10 connected to the processing unit 21 through a serial-port interface 46 coupled to 
system bus 23; but they may be connected through other interfaces not shown in 
Figure 1, such as a parallel port, a game port, or a universal serial bus (USB). A 
monitor 47 or other display device also connects to system bus 23 via an 
interface such as a video adapter 48. In addition to the monitor, personal 

1 5 computers typically include other peripheral output devices (not shown) such as 
speakers and printers. 

Personal computer 20 may operate in a networked environment using 
logical connections to one or more remote computers such as remote computer 
49. Remote computer 49 may be another personal computer, a server, a router, a 

20 network PC, a peer device, or other common network node. It typically includes 
many or all of the components described above in connection with personal 
computer 20; however, only a storage device 50 is illustrated in Figure 1 . The 
logical connections depicted in Figure 1 include local-area network (LAN) 5 1 
and a wide-area network (WAN) 52. Such networking environments are 

25 commonplace in offices, enterprise-wide computer networks, intranets and the 
Internet. 

When placed in a LAN networking environment, PC 20 connects to local 
network 5 1 through a network interface or adapter 53. When used in a WAN 
networking environment such as the Internet, PC 20 typically includes modem 
30 54 or other means for establishing communications over network 52. Modem 54 

7 
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may be internal or external to PC 20, and connects to system bus 23 via serial- 
port interface 46. In a networked environment, program modules, such as those 
comprising Microsoft® Word which are depicted as residing within 20 or 
portions thereof may be stored in remote storage device 50. Of course, the 
5 network connections shown are illustrative, and other means of establishing a 
communications link between the computers may be substituted. 

Software may be designed using many different methods, including 
object oriented programming methods. C++ and Java are two examples of 
common object oriented computer programming languages that provide 

10 functionality associated with object oriented programming. Object oriented 

programming methods provide a means to encapsulate data members (variables) 
and member functions (methods) that operate on that data into a single entity 
called a class. Object oriented programming methods also provide a means to 
create new classes based on existing classes. 

15 An object is an instance of a class. The data members of an object are 

attributes that are stored inside the computer memory, and the methods are 
executable computer code that act upon this data, along with potentially 
providing other services. The notion of an object is exploited in the present 
invention in that certain aspects of the invention are implemented as objects in 

20 one embodiment. 

An interface is a group of related functions that are organized into a 
named unit. Each interface may be uniquely identified by some identifier. 
Interfaces have no instantiation, that is, an interface is a definition only without 
the executable code needed to implement the methods which are specified by the 

25 interface. An object may support an interface by providing executable code for 
the methods specified by the interface. The executable code supplied by the 
object must comply with the definitions specified by the interface. The object 
may also provide additional methods. Those skilled in the art will recognize that 
interfaces are not limited to use in or by an object oriented programming 

30 environment. 
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High Level Encoder and Decoder Description 
A simplified block diagram of a wavelet transform based image pixel 
encoder is shown in Figure 2, with a corresponding decoder shown in Figure 3. 
While the encoder and decoder are described with respect to image pixel data as 
5 the respective input and output, other data can also be transformed as desired. In 
the embodiment shown, image pixel data is provided to a wavelet transform 
block 210, which operates in a known manner to provide wavelet coefficients to 
a quantization block 220. The wavelet coefficients are in a big block format as 
described in the background section. Quantization is performed by means of a 

10 uniform quantizer, which is controlled by a quantization step defining threshold 
T. This results in the representation of each coefficient falling between the steps 
by the value in the middle of the step. The smaller T, the less loss is incurred in 
the quantization. Thus, the output of block 220 is a series of integer numbers, 
which are quantized wavelet coefficients. As in many other applications, the 

15 quantizer may be based on normal rounding, or in rounding towards zero (also 
known as a quantizer with a "dead zone"). 

A reordering and blocking function or block 230 groups wavelet 
coefficients into clusters of like values. It results in a clustering or grouping 
together of the blocks of frequency coefficients which are most likely to be zero. 

20 The reordering increases the likelihood of groupings of similar data, in the sense 
that the data tends to have a monotonically decaying distribution of amplitudes. 
The first blocks tend to have data of larger amplitude, whereas in subsequent 
blocks the amplitudes of the wavelet coefficients tend to decay. The grouping is 
done by fixing a scanning order, which is data independent. One set of such 

25 grouping is shown in Figure 4, for an example with 64 blocks of wavelet 

coefficients. In Figure 4, low frequency components are placed toward the upper 
left corner of the grouping with an alternation of blocks of coefficients from low- 
high and high-low subbands at each level. Reordering and blocking block 230 
provides a sequence of macroblocks in the scanning order indicated. The first 

30 block, 0, contains all coefficients of level 0 of the wavelet tree. This corresponds 

9 
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to the coarsest resolution. Blocks 0 to 3 comprise all the coefficients of level 1. 
Blocks 0 to 15 comprise all coefficients of level 2, while level 3 comprises 
blocks 0 to 63. Note that the blocks alternate from low-high and high-low 
subbands at each level, with low-high being the top of the sequence. In the 
5 Mathematical Description section below we will discuss the advantages of that 
particular ordering. Other orderings are possible as will be seen by one skilled in 
the art, but the above ordering appears to work better than others. The bits are 
then encoded sequentially, starting at the most significant bit. 

An adaptive encoding block 240 receives the macroblocks and encodes 

10 them in a lossless manner. The clustering of the blocks provide data to compress 
which has large clusters of zeros. Further reordering the data by encoding on a 
bit plane basis increases the likelihood of finding large strings of zeros. Starting 
with the most significant bit for the first bit plane leads to a higher likelihood of 
a long string of zeros. Further, this also ensures that the most relevant data is 

15 encoded first. By the time the third or fourth bit planes are encoded, the odds are 
about equal for a zero as opposed to a one, and straight binary encoding may be 
effectively used. 

The encoder is an adaptation of a Golomb-Rice encoder with adaptive 
run-length modifications. In simple terms, a string of 2 zeros is represented by 

20 the codeword consisting of a single bit equal to zero. The length of the string of 
zeros represented by the zero codeword is controlled by the parameter k, which 
is varied as data is encountered, based on the observed frequency of zeros. 
When a zero value is encoded, it is assumed that zeros are more likely, and so 
the value of the parameter k is increased. When a nonzero value is encountered, 

25 k is decreased. By . controlling the amount of such increase and decrease 
appropriately, the encoder can track well a string of bits with a varying 
probability of zero, without the need of the overhead of actually estimating that 
probability. A feedback loop 245 is used to represent the backwards adaptive 
nature of the encoder 240. This encoding provides for efficient compression 

30 and fast adaptation to changes in the statistics of the incoming data. Encoder 

10 
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240 provides a bit stream out which is effectively progressive in that the most 
relevant information is provided at the beginning of the bit stream. Since the 
least significant bits are encoded in the last bit plane, for lower resolution bit 
streams, they may effectively be discarded or not encoded, as represented at a 
5 resolution fidelity block 250. This is usefiil for lower bandwidth transmissions 
of data. 

Decoding, as shown in block form in Figure 3 is essentially the reverse of 
the encoding and data transformations. A bit stream of encoded data, such as 
that produced by the encoder of Figure 2 is received at a lossless adaptive 

10 decoding block 310. The bit stream may be received directly from the decoder, 
from local storage, or from a remote decoder or storage via one of many viable 
transmission media such as by satellite transmission, cable transmission or other 
network. Decoding block 3 1 0 receives the rules developed during encoding via 
a feed forward line 315. Block 310 essentially receives the string length to be 

15 used, and reconstructs the data in accordance with the rules. Again, it operates 
on a block level, but this is not a requirement of the invention. It simply makes 
it more convenient than working with an entire representation of an image or 
other data all at the same time, which would require a larger amount of memory, 
or paging if such memory was not available. One form of fidelity reduction 

20 may be performed at block 3 10 just by not decoding the last bit in the bit plane. 
This effectively doubles the step size controlled by the parameter T. It is a 
simple way to reduce the fidelity of the data. 

The data out of block 310 should be identical to the integer data coming 
out of block 230. However, higher resolution layers of the image at 320 may be 

25 removed at this point as indicated at block 320, just by effectively not using the 
higher frequency wavelet coefficients. This would be useful if the window used 
to display an image or set of images is small. Block 330 then is used to 
unshuffle or reorder the blocks back to the original positions. The output of the 
reorder block 330 is the integer numbers that need to be remultiplied back at 

30 block 340 by using the step size which is provided by a header in the received bit 

11 
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stream. This provides reconstructed wavelet coefficients. The header also 
provides information about how big the image size is, and other standard image 
format data. An inverse wavelet transform is then performed in a known manner 
at 350. It should be noted that the only losses, other than selected desired 
5 fidelity or resolution reductions, are incurred in the quantization steps, which is 
controllable by modification of the T parameter. 

The resolution reduction option block 320 may operate in a few different 
ways. One way to remove the data is by zeroing the integers involved. A 
further way to reduce the resolution is to modify the operation of unshuffle block 

10 330, which may be instructed to zero the values at a desired point. By telling 
both unshuffle block 330, and inverse wavelet transform block 350 where the 
zeros start, they may be easily modified to eliminate unneeded processing of 
actual data at such points. 

The adaptive encoding and decoding of the present invention operates 

1 5 very well on data that has clustered zeros with statistics that change. This type 
of data may also be characterized as having a high probability of data with near 
exponential decay of the probability on either side of the zeros. Multimedia 
data, such as static image data and video has this characteristic. Further, the 
transformation of many types of physical data also has this type of characteristic. 

20 When capturing physical data, the information normally occurs in just a few 

places, which means that most of the other data is zero. Symmetry of the data is 
also a desired characteristic for this type of encoding to work best. In other 
words, an exponential fall off of both negative and positive values on either side 
of an information spike is beneficial. Examples of such physical data include 

25 ECGs and other biometric type of data. 

Mathematical Description of Encoding 

A mathematical description of the transformations and encoding and 
decoding discussed above with respect to Figures 2 and 3 is now provided. The 
following steps define the encoding algorithm: 
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1. Given an image array x(m, w), m = 0, 1, M- 1, n = 0, 1, N- 1, 
compute its wavelet transform coefficients A"(r, s), r = 0, 1 , . . . , Af - 1 , 5 

= 0, 1, ...,7V- 1. 
2. Each coefficient A(r, 5) is quantized according to 

5 q(r y s) = sgn(X(r,s))[\X(r 9 s)\/T] . (1) 

where sgn(-) is the usual signum function and T is a quantization 
threshold. This step maps the continuous wavelet coefficients X(r 9 s) into 
a sequence of integers q(r, s). This is the only step that introduces 
information loss. 

10 3. The quantized coefficients are reordered and grouped into blocks 
according to 

"k (0 - q{r k + mod(/, M B ), s k + [// M B J) 

for/ = 0, 1, ...,Z,-1 and ^ = 0, 1, ...,AT- 1, where L = M B N B is the 
block size, AT = MNjL is the total number of blocks, and M B and 7/ 5 are 

15 defined by M B = Af/ 2 y and AT 5 = n/2 j . The parameter J controls the f 

size of the rectangular blocks of quantized coefficients that are grouped in 
u k (I) , and hence the block size. 

For each K the top left comer indices (r*, s*) are defined according to the 
scan order previously described. 

20 4. The blocks are grouped into macroblocks U, of fixed size LK B , in the form 
Ui = {**(/)}, with k = i K B , i K B + I7 . ..yiK B +K B -\. For each - 
macroblock, its bit planes are successively quantized according to the 
adaptive Run-length/Rice (RLR) coder. The binary encoding of the 
number of bits used by the RLR code for £/, followed by the actual RLR 

25 output bits is appended to the output bitstream. 
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The following steps are then used to decode the PWC bitstream: 

1 . Decode the RLR-coded bits in macroblocks £/,-, for / = 0, 1 , . . I max -1 . If 
Imax < AT, a lower resolution version of the wavelet coefficients is 
recovered. Note that within each macroblock just the first few bit planes 
are decoded, given the desired reconstruction accuracy. All bits in the bit 
planes of q(r 7 s) that are chosen not to decode are set to zero. Resolution 
scalability is achieved by choosing I max < K> whereas fidelity scalability is 
achieved by decoding only a subset of the bit planes for each macroblock. 

2. After recovering the q(r, s\ the wavelet coefficients are reconstructed by 



10 X(r,s) = < 



0, ?(r,s) = 0 

T[q{r 9 s) + \/2\ q(r,s)>0 (3) 
T[q{r 9 s)-\I2\ q(r,s)<0 



It should be noted that the quantization rule in (2) combined with the 
reconstruction rule in (3) comprise a uniform quantizer with a dead zone 
around the origin, which is close to being optimal for minimal-entropy 
scalar quantization of random variables with Laplacian (double-sided 
15 exponential) probability distributions. 

To reorder the wavelet coefficients, as described in Step 3 of the PWC 
encoder, the sequence of top left corner indices (r*, Sk) is defined. The scanning 

order depicted in Figure 4, where M B = M/ 2 j and N B = n/ 2 j control the size 
of each block is used. The parameter J should be chosen such that block zero 

20 contains precisely all wavelet coefficients at the coarsest resolution, e.g. all 
scaling function coefficients. Therefore, J should be equal to the number of 
resolution levels (the tree depth) used in the wavelet transform. It is easy to infer 
from Figure 4 the sequence of all top left comer indices (r*, Sk). 

It is clear from Figure 4 that in order to decode a complete set of 

25 coefficients at any desired level resolution, it is desirable to use all blocks from 
index 0 up to K max -1, where K max is a power of four. Therefore, in Step 1 of the 
PWC decoder, I max -1 is chosen such that K max is a power of four. 
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The reason for the alternate scanning of the low-high (LH) and high-low 
(HL) wavelet coefficients within the same resolution level is simple. Assuming 
the original image has a particular feature (or no feature) at some spatial 
location, it is likely that clusters of both the LH and HL subbands, corresponding 
5 to that location, will have large (or small) values. Therefore, by ensuring that 
pairs of blocks from the LH and HL subbands corresponding to the same spatial 
location appear contiguously in a macroblock or at least proximate or close to 
each other, we're more likely to create clusters of large and small values. That 
increases the probability of long runs of zeros in the bit planes of the quantized 
10 coefficients. 

A flowchart in Figure 7 describes an algorithm used to write the blocks 
of coefficients in the order shown in Figure 4. The algorithm may be 
implemented in computer program instructions, or in hardware, firmware or a 
combination of all as desired. The algorithm is entered at start block 710. An 

15 input matrix Q containing MxN quantized wavelet coefficients is read at 715. 
The coefficients are such as those provided by quantization block 220. A 
number of wavelet levels is defined at 720 in a known manner as JW. At block * 
725, a block size is defined asNHx NV, with NH equal to M/(2 JW ) and NV 
equal to N/(2 JW ). The first output block is then written at 730, and IH and IV are 

20 initialized as NH and NV respectively for use in defining loops for writing of 

further blocks, which are larger in size. For a simplified example, assume that in 
Figure 4, the matrix Q is 1 6 by 16, with 4 levels, and a block size of 1 . This 
provides an initial IH and IV of 1. In further examples, the block size is larger, 
such as 8x8 or 16x16, or even higher. 

25 A decision block 740 is used to determine if the entire matrix of 

coefficients has been written by checking to see if IH is less than M. If IH is still 
less than M, more coefficients need to be written. As seen in Figure 4, the first 
blocks of coefficients are of dimension 1 by 1, and then they increase to 2 by 2 
and 4 by 4 etc. The next sets of flowchart blocks are used to write the 

30 succeeding blocks by looping from one to a block size parameter NBLK which 
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is set at block 745 as IH/NH. A nested loop defined at 750 using I and 755 
using J is used to control the order of writing of the output blocks LH and HL at 
760. J is incremented at NEXT statement 762, while I is incremented at NEXT 
statement 764. This results in rows of the blocks being written first in this 
5 particular implementation. Columns may also be written first if desired, or any 
other order of writing may be used. For the first time through the loop, given a 
matrix of size 16 by 16 and 4 levels, NBLK is also 1, so only blocks 430 and 440 
are written. 

Following the writing of the next LH and HL blocks, a second set of 

10 nested loops is set up at 770 and 775, again using I and J to define positions in 
which to write an output block at 780. This output block corresponds to HH 
blocks at the same level, which is block 450 for the first time through. NEXT J 
and NEXT I statements complete the nested loop at 782 and 784 respectively. It 
should be noted that the HH block could also have been written at the same time 

15 as the LH and HL blocks above since the nested loops are identical. After all the 
blocks at this level have been written, IH and IV are incremented as exponents of 
2 at 790, and then compared at 740 to see if IH is still less than M. If IH is not 
less than M, the algorithm is exited at 795, after having provided at complete 
reordered set of wavelet coefficients in accordance with the present invention. 

20 The second time through the nested loops, blocks 455, 460 and 470 are 

written, followed by blocks 480, 475 and 490 the third time through the nested 
loops. Larger matrix sizes with higher levels are also contemplated. 

To recover the original order for decoding purposes, one can simply read 
the output of the reordering algorithm in the same manner in which it was 

25 written. All that is required is knowledge of the size of the original matrix, and 
the number of levels that were written. Then the writing order is simply reversed 
to provide the coefficients in the original order. A direct mapping may also be 
used, but would require significant additional bandwidth to provide. 
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Details of Bit-Plane Encoding 
The process performed by encoding block 240 can be easily understood 
with the help of the diagram in Table 1. The bit planes are just the sequences of 
5 bits of a particular index in the binary representation (magnitude + sign) of the 
incoming quantized wavelet coefficients or other data. For example Table 1 
shows the bit planes for the sequence of values {9, -6, 1, 0, -2, 3, -4, -1, 2}. In 
the table, bit plane 4 is the sequence {100000000}, bit plane 3 is the sequence 
{010000100}, bit plane 2 is the sequence {01001 1001}, and bit plane 1 is the 
10 sequence {101001010}. 



Table 1 - Bit plane decomposition of integer data 



DATA VALUES 


-> 


9 


-6 


1 


0 


-2 


3 


-4 


-1 


2 


SIGN BIT 


— > 


0 


1 


0 


0 


1 


0 


1 


1 


0 


BIT PLANE 4 


— > 


1 


0 


0 


0 


0 


0 


0 


0 


0 


BIT PLANE 3 


— > 


0 


1 


0 


0 


0 


0 


1 


0 


0 


BIT PLANE 2 


— > 


0 


1 


0 


0 


1 


1 


0 


0 


1 


BIT PLANE 1 




1 


0 


1 


0 


0 


1 


0 


1 


0 



15 ■ - . 

In the input data in Table 1, values of smaller magnitude seem to be more 
likely to occur, which is also typical of quantized wavelet data and finite 
alphabet data. One can see from the patterns above that the higher bit planes 
tend to show a higher frequency of zeros, because input values of higher 

20 magnitude are less likely. Bit plane 1 (the least significant bit) and the sign bit 
plane typically have zeros and ones with approximately equal frequency. 
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The flow chart in Figure 5 describes the algorithm for efficiently 
encoding the incoming data through bit planes starting at 505. The bit planes are 
first read from an input buffer x at 510 which contains N numbers. The number 
of bits planes, bmax, is computed at 515, and a significance flag vector sflg is set 
5 to all zeros at 520. 

At 525, the bit plane index variable bit is set equal to bmax, so encoding 
starts with the most significant bit plane. The values of the bits pointed to by the 
index "bit" form the bit plane vector bp at 530. For each plane bp, the bits are 
divided into two subsets as indicated at blocks 535 and 540. xl correspond to 

10 positions for which a 44 1" entry has not been seen in the higher planes - those are 
called significant bits. x2 corresponds to positions for which a "1" has already 
been seen in the higher planes - those are called refinement bits. y 

At block 545, xl is encoded with the adaptive run-length Golomb-Rice 
(ARLGR) encoder which will benefit from the higher frequency of zeros in xl. \ 

15 For every bit equal to 1 in xl, the sign bit is also encoded and appended at the 

end of the output code. ^ 

At block 550, x2 is encoded with straight binary encoding. This is done 
by appending the x2 bits to the output stream. Minimal loss in encoding 
efficiency is encountered because zeros and ones are usually equally likely in x2. 

20 Note that the sign bits are not referred to as a bit plane because they are 

not processed as a bit plane. The sign bits are sent in the process of coding the xl 
vectors of each bit plane. Thus, we can also think of the vector xl as being 
drawn from the alphabet {0, +1, -1 }, i.e. bit plus sign. 

An important property of the flow chart in Figure 5 is that the 

25 information on which are the bits that belong to xl and which are the bits that 
belong to x2 does not need to be explicitly encoded. The vector sflg controls the 
allocation of bits to xl , and sflg is first initialized to all zeros, and then updated 
after each bit plane is encoded at 555. Therefore, the decoder can easily track the 
changes to sflg. To continue to the next bit plane, bit is decremented at 560 and 

30 checked to see if the last plane has been decoded at 565. If not, control goes to 
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block 530 for encoding of the next bit plane. If bit was equal to zero, or a higher 
number if a lower resolution coding is desired, an output buffer containing 
outputs of all xl and x2 encodings is written at 570 and the process ends at 575. 
The adaptive Run-length + Golomb-Rice (ARLGR) coder is where the 
5 encoding gain resides. It maps long vectors xl with lots of zeros in a more 
compact code, with fewer zeros. The ARLGR encoder can be used to encoding 
binary sequences with or without associated sign bits, as shown below. In order 
to understand the ARGLR encoder, first consider the basics of the run-length 
encoding and Golomb-Rice coding. 

10 In its general form, the basic idea behind run-length (RL) coding is to 

replace long strings of the same value in an input data vector by a code that 
specifies the value to be repeated and how many times the values should be 
repeated. If such repetitive strings are long enough and frequent enough, RL 
coding leads to a significant reduction in the number of bits needed to represent 

1 5 the data vector. 

RL coding can be applied to the encoding of binary data in which either 0 
or 1 is significantly more likely to occur. One example is in graphics files, for 
example, a digitized black drawing on a white background. If white picture 
elements (pixels) are represented by a bit equal to 0 and black dots by a bit equal 

20 to 1, it's clear that zeros are much more likely to occur. In fact, many standard 
graphics file formats use RL coding. 

In 1966 Golomb proposed a simple code for the representation of 
positive numbers. It was later shown that the Golomb code is indeed optimal 
(minimum expected length) if the numbers are drawn from a source with 

25 geometric probability distribution, i.e. if Prob {x = n) = ab n , where a and b are 
parameters. A few years later Rice independently derived a subset of the Golomb 
code that is very easy to implement in practice. These codes became known as 
Golomb-Rice codes. 

In the present invention the Golomb-Rice codes for a source of binary 

30 digits are combined with RL codes. The resulting Run-Length = Golomb-Rice 
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code is shown in Table 2. The code is characterized by a parameter k, which 
controls the length of the run associated to the codeword 0; this maximum run 
length is equal to 2*. 

Table 2 - Run-Length + Golomb-Rice encoding of a source generating symbols 

e {0,1} 



K 


INPUT 
STRTNG 


OUTPUT K 

BINARY 

CODE 


INPUT 
STRING 


OUTPUT 
BINARY 
CODE 


0 


0 


0 3 


00000000 


0 




1 


1 


1 


10000 


1 


00 


0 


01 


10010 




1 


10 


001 


10011 




01 


11 


0001 


10101 


2 


0000 


0 


00001 


10111 




1 


100 


000001 


11000 




01 


101 


0000001 


11010 




001 


110 


00000001 


11100 




0001 


111 



10 For encoding of the xl vector in the bit-plane encoder described earlier, 

we need to append the sign to the codeword of each nonzero bit. For that, a 
simple extension of the RLGR code is used as shown in Table 3. 
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Table 3 - Run-Length + Golomb-Rice encoding of a source generating symbols 
e {0.+1.-1} 



K 


INPUT 


OUTPUT k 


INPUT 


OUTPUT 




STRING 


BINARY 


STRING 


BINARY 






CODE 




CODE 


0 


0 


0 


00000000 


0 


+1 


10 


+1 


10000 


-1 


11 


-1 


10001 


1 


00 


0 


0+1 


10010 


+1 


100 


0-1 


10011 


-1 


101 


00+1 


10100 


0+1 


110 


00-1 


10101 


0-1 


111 


000+1 


10110 


2 


0000 


0 


000-1 


10111 


+1 


1000 


0000+1 


11000 


-1 


1001 


0000-1 


11001 


0+1 


1010 


00000+1 


11010 


0-1 


1011 


00000-1 


11011 


00+1 


1100 


000000+1 


11100 


00-1 


1101 


000000-1 


11101 


000+1 


1110 


0000000+1 


11110 


000-1 


1111 


0000000-1 


11111 



For a given source of input vectors, using either the {0,1 } or the {0,+l,- 

1 } alphabets, the parameter k should be chosen in order to minimize the 
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expected code length. If the source has no memory, has constant statistics over 
time, and is characterized by Po = Prob {symbol = 0}, then it is easy to compute 
the optimal value of k as a function of Po. 

In practice, however, binary (or binary + sign) vectors are not stationary. 
5 Typical examples include data obtained from the physical world, such as 

quantized wavelet coefficients of pictures or scanned documents. Therefore, we 
need to adjust the RLGR parameter k over time, to best match the local statistics 
of the data. Many strategies have been considered, mostly involving dividing 
the input data in blocks of appropriate length. For each block, P 0 is estimated 
10 and then the optimal value of k is computed. An additional code is then sent at 
the beginning of each block to indicate the value of k that should be used by the 
decoder. 

The encoder 240 takes a new approach. A backward-adaptive strategy is 
used for changing the RLGR parameter k. By backward-adaptive, it is meant that 

15 variations in k are computed based on encoded symbols, not directly on the input 
data. The basic strategy is that the value of k to be used in encoding the next 
symbol should depend only on previously encoded data. Therefore, all the 
decoder needs to do to recover the changing values of k is to apply the same 
adaptation rule as the encoder. Therefore, to simplify decoding it is important 

20 that such a rule be as simple as possible to compute. 

The new adaptive Run-Length + Golomb-Rice (ARLGR) encoder 240 
uses the following rules for changing the parameter k. Several parameters are 
first defined at block 604. A scale factor L is first defined and is used to define 
kp as L*k. kp is an auxiliary parameter whose value moves up or down by an 

25 amount Up or Dn respectively to permit fractional moves of k without the use of 
floating-point arithmetic. Finally, Uq is defined and used to move kp up if the 
output code was zero and k was equal to zero. An input buffer x is read at 606, 
and contains M numbers. At 608, k is set to kO, kp is set to L*k and run is set to 
0. The process is started with a value of k that is a good choice for the long-term 
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statistics of the incoming data, e.g. k = 2. Starting with the first symbol, 
xindex=l at 610, symbol is set to x(xindex) and runmax is set to 2 k . 

As an overview of the encoding process, after encoding a source symbol, 
kp is adjusted based on the emitted output code. If the output code was 0 and k 
5 * 0, kp is incremented by a predefined increment step Up, i.e. set kp = kp + Up. 
If the output code was 0 and k = 0, kp is incremented by a predefined increment 
step Uq, i.e. set kp = kp + Uq. 

If the output code started with a 1 (corresponding to a nonzero input), kp is 
decremented by a predefined decrement step Dn, i.e. set kp = kp - Dn. The 

10 value of k for encoding the next input symbol is set to k = Lkp/L J (i.e. truncate 
kp/L down to the nearest integer. 

The algorithm is based in a simple strategy. If a run of zeros is 
encountered, k is increased to allow for longer sequences of zeros to be captured 
by a single output bit = 0. If a nonzero symbol is encountered, k is reduced to 

15 avoid excessively long output codes. The use of the auxiliary parameter kp and 
the scale factor L above allows adjustment of k in fractional steps without having 
to use floating-point arithmetic as indicated above. 

For most of the data tested in the ARLGR encoder, the performance was 
quite good (encoded rates very close to source entropies), for the following „ 

20 typical choice of parameters: L = 4, Up = 4, Dn = 5, and Uq = 2. In some cases, 
adjustments on these parameters can lead to slightly better performance. " 

Returning to the description of the flowchart in Figure 6, following 
initialization and defining of parameters as described above with reference to 
blocks 602, 604, 606, 608, 610 and 612, k is first checked at 614 to see if it is 

25 equal to zero. If it is, and if symbol is zero, Uq is added to kp at 61 8. A zero is 
appended to the output buffer at 620 and if kp is out of range - above kpmax - at 
622, it is clipped. At 624, k is set to the largest integer less than kp/L, the scale 
factor. Xindex is then incremented, and if less than M as determined at 628, the 
next symbol is selected at 612. If greater than M, the output bit buffer is written 

30 to at 630 and the process ends at 640. 
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Referring back to decision block 616, if symbol was not equal to zero, a 
1 is appended to the output bit buffer at 642, and a sign bit of symbol is 
appended to the output bit buffer at 644 if the data has a sign bit, and processing 
continues at 622 to check to see if kp is within range. 
5 If A: is not equal to 1 at block 614, a further check of symbol is performed 

at 650. If the symbol is not equal to zero, a 1 is appended to the output bit buffer 
at 652 and a k-bit value of run is appended to the output bit buffer at 654. At 
656, Dn is subtracted from kp, and processing continues at 644, where an 
optional sign bit is appended. 

10 If symbol is found to be zero at 650, run is checked at 622 to see if it is 

equal to runmax. If not, kp is clipped to not exceed kpmax at 622. If run was 
equal to runmax at 662, a zero is appended to the output bit buffer at 664, and 
run is set to zero at 666. Finally, Up is added to kp, and processing again reverts 
to block 622 for clipping of kp, setting of k at 624, incrementing xindex at 626 

15 and checking to see if the last symbol has been processed at 628. If so, the 

information is written to the ouput bit buffer at 630 and the process is ended at 
640. 

In Table 4 results of using the bit plane encoder on quantized wavelet 
coefficients are shown. Note that the simple bit-plane encoder performs better 
20 than the adaptive arithmetic encoders (which are considered the state-of-the-art), 
in spite of being computationally simpler. 
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Table 4 - Output code length in bytes for quantized and reordered wavelet 

coefficients as input. 



Data Set 


Bit-plane 


Adaptive 


Adaptive ELS 


(length = 30,000 values) 


encoder of 


arithmetic encoder 


encoder 




this invention 






Wavelet data, low 


8,359 


12,748 


12,129 


frequency, 








Wavelet data, medium 


4,906 


5,608 


5,022 


frequency 









5 

A major advantage of the encoder, not shared by the arithmetic encoders, 



is scalability. With the described bit-plane encoding, a lower fidelity version of 
the signal can be easily obtained by stopping the decoding process at a bit plane 
higher than plane 1. That allows for progressive transmission and reconstruction 
10 of the information, and important feature for communication channels such as 
the Internet. Another application of scalability is in digital cameras, for example. 
If the user wants to take more pictures and is willing to sacrifice quality of 
pictures already stored, lower bit planes of existing images can be removed to 
release storage for new pictures. 
15 Although the ARLGR encoder is described in conjunction with its use in 

a bit plane encoder, it can be quite useful as a general-purpose encoder for - 
binary data in which the value 0 is much more probably than the value 1 . This 
is especially true in cases where the probability distribution is constantly 
changing. For example, consider the problem of encoding a black-and-white 
20 drawing scanned at a resolution of 480 x 640 pixels. Assuming the mapping 
white = 0 and black = 1, the ARLGR encoder may be applied directly to the 
data. However, encoder 240 does not handle runs of Is very well, and so a 
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difference operator is first applied across all rows of pixels. Starting with the 
second row and moving down, each pixel value is replaced by 0 if it has the 
same color as the same pixel in the row above, or 1 if it has a different color. 
This is repeated across columns. The resulting bits are encoded with the 
5 ARLGR encoder 240. 

This provides a mapping of runs of either white or black into runs of 
zeros, without any loss of information. That makes the data more suitable for 
ARLGR encoding. Table 5 shows a comparison of the performance of such a 
simple encoder with other approaches. 

10 

Table 5 - Output code length in bytes for encoding typical black-and white 



picture data. 



ARLGR 


CCITT fax 


Adaptive ELS 


Adaptive 


encoder 


standard 


encoder 


arithmetic encoder 


described 








3,294 


5,926 


5,331 


3,393 



The ARLGR encoder 240 algorithm outperforms the standard fax 

15 encoding algorithm by almost a factor of two. It uses only 55% of the bytes 

used by the fax algorithm. In fact, the new ARLGR-based encoder even 

surpassed the state-of-the-art adaptive arithmetic encoder by a small margin for 

this particular image. In addition, it had the lowest computational complexity. It 

should be noted that this is just one example, and that the results may vary 

20 depending on the image used and tuning of parameters. 

In Figure 8, a block diagram of a suite of office programs is shown 

generally at 810. One particular office suite comprises a plurality of high level 

applications indicated at 812, including such applications as word processing, 

email, spreadsheet, presentation tools, photo manipulation programs, and 

25 browsers. Supporting these applications are at least two lower level software, 

hardware or a combination thereof functions at 826 and 818. The functions 
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shown include a video in/out function 826 and a fax/scanner function 818. 
Many other functions may also reside at this level. 

In particular, the video function provides the ability to both display video 
and receive video and image data from external sources. The video and 
5 fax/scanner functions make use of the encoder and decoder described herein and 
indicated at block 832 to provide encoding and decoding functions as previously 
described. If raw image or other suitable data is captured in pixel or other form, 
the encoder 832 is used to encode it. Further, if encoded data is obtained from 
any source employing the type of encoding described here, the decoder at 832 is 
10 called by the application receiving it to transform or decode it to a displayable or 
useable format. 

It should be noted that many of the applications which may comprise 
such an integrated office suite, such as Microsoft Office or follow-on products "~ 
that may integrate even more applications are more and more likely to deal with 

15 data that needs to be compressed or decompressed. The present invention 
provides an alternative to other forms of coding which removes the blocking 
artifacts present in JPEG, and is less complex to implement in either software, 
hardware or hybrid forms as desired. The encoder/decoder at 832 is also easy to 
integrate into such an office suite. 

20 Conclusion 

Reordering of quantized wavelet coefficients is performed to cluster large 
and small wavelet coefficients into separate groups without requiring the use of 
data-dependent data structures. The coefficients are then adaptively encoded 
based on a run-length code which continuously modifies a parameter that 

25 controls the codewords used to represent strings of quantized coefficients, 
seeking to minimize the number of bits spent in the codewords. Since the 
ordering pattern is fixed, and the coefficient encoding does not require a 
modified table for each image, the invention lends itself to easier hardware or 
software implementations. Further advantages include the elimination of 
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blocking artifacts, and single pass encoding for any desired compression ratio for 
image data. 

A decoder is described which applies the above encoding and blocking in 
reverse order. Decoding of the encoded coefficients is first performed, followed 
5 by an unshuffling of the coefficients. The unshuffled coefficients are then 
subjected to an inverse wavelet transform to recover the transformed and 
compressed data, such as image pixels. Adaptive arithmetic coding may also be 
used in conjunction with the reordering to obtain similar compression benefits, 
but with slightly higher complexity. 
10 By not requiring the use of data-dependent data structures such as zero 

trees, or a separate list for set partitions in trees, hardware implementations are 
easier to build. This application is intended to cover any adaptations or 
variations of the present invention. It is manifestly intended that this invention 
be limited only by the claims and equivalents thereof. 
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What is claimed is: 

1 . A method of encoding finite alphabet data characters, the method 
comprising: 

5 initializing a string length for an adaptive run-length encoder; 

modifying the string length based on an encoding parameter; and 
modifying the encoding parameter based on already encoded characters. 

2. The method of claim 1 wherein the encoding parameter is increased each 
10 time an expected character is encountered. 

3. The method of claim 2 wherein the expected character is zero. 

4. The method of claim 1 wherein the encoding parameter is decreased each 
15 time an expected character is not encountered. 

5. The method of claim 4 wherein the expected character is zero. 

6. The method of claim 1 wherein the characters are encoded on a bit plane 
20 basis. 

7. The method of claim 6 wherein the least significant bit planes are 
encoded using binary encoding. 

25 8. The method of claim 6 wherein only the most significant bit planes are 
encoded to modify the resolution of the encoding. 

9. The method of claim 1 wherein a string of zeros is represented by a small 
value. 
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10. The method of claim 1 and further comprising representing strings by 
values which are generally shorter for longer strings, and providing a progressive 
encoded bit stream. 

5 

11. A computer readable medium having instructions stored thereon causing 
a computer to perform the method of claim 1 . 

12. A method of encoding finite alphabet data comprising: 
initializing a string length for an adaptive run-length encoder; 
modifying the string length by a power, k, wherein k is an adaptive 

parameter; 

increasing the adaptive parameter k each time a zero is encoded; and 
decreasing the adaptive parameter k each time a one is encoded, wherein 
string lengths are represented by symbols that are encoded on a bit plane basis. 

13. The method of claim 12 wherein k is an integer, and wherein a scaling 
factor, L, is used to increment the encoding parameter a fraction at a time. 

20 14. The method of claim 12, wherein the data comprises wavelet transform 
coefficients. 

15. The method of claim 12, wherein the data is representative of biometric 
measurements. 

25 

16. The method of claim 12, wherein the data is representative of a scan of 
an image to be faxed. 

17. An encoder for encoding finite alphabet data comprising: 

30 means for initializing a string length for an adaptive run-length encoder; 

means for modifying the string length based on an encoding parameter; 

and 
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means ror mocnrying tne encoding parameter based on already encoded 

data. 

1 8. A computer readable medium having a data structure stored thereon that 
5 represents a compressed image for decoding by a computer, the data structure 

comprising: 

a plurality of values representing strings of ones and zeros; and 
an encoding parameter associated with selected strings of zeros based on 
previously encoded data which is used to modify the length of a selected string. 

10 

19. A decoder for decoding the data encoded by an encoder of claim 1 8. 

20. A method of decoding finite alphabet data characters which have been 
encoded using a backward adaptive run length encoding, the method comprising: 

15 initializing a string length for an adaptive run-length decoder; 

modifying the string length based on a decoding parameter; and 
modifying the decoding parameter based on already decoded characters. 

2 1 . The method of claim 20 wherein the decoding parameter is increased 
20 each time an expected character is encountered. 

22. The method of claim 22 wherein the expected character is zero. 

23 . The method of claim 20 wherein the decoding parameter is decreased 
25 each time an expected character is not encountered. 

24. The method of claim 23 wherein the expected character is zero. 

25. The method of claim 20 wherein the characters are decoded on a bit 
30 plane basis. 

26. The method of claim 25 wherein the least significant bit planes are 
decoded using binary encoding. 
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27. The method of claim 26 wherein only the most significant bit planes are 
decoded to modify the resolution of the encoding. 

5 28. The method of claim 20 wherein a string of zeros is represented by a 
small value. 

29. A computer readable medium having computer executable instructions 
stored thereon for causing a computer to perform a method of decoding finite 

10 alphabet data characters which have been encoded using a backward adaptive 
run length encoding, the method comprising: 

initializing a string length for an adaptive run-length decoder; 
modifying the string length based on a decoding parameter; and 
modifying the decoding parameter based on already decoded characters. 

15 

30. A decoder for decoding finite alphabet data comprising: 

means for initializing a string length for decoding adaptive run-length 
encoded data; 

means for modifying the string length based on a decoding parameter; 

20 and 

means for modifying the decoding parameter based on already decoded 

data. 
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